1,975 questions with Azure Databricks tags

Sort by: Updated
1 answer

Delete the file from SharePoint location

Hi All, I am trying to copy the files from Share Point to ADLS and referring to the below URL pipeline to achieve the copy functionality. https://www.syntera.ch/blog/2022/10/10/copy-files-from-sharepoint-to-blob-storage-using-azure-data-factory/ I need…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,374 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,776 questions
asked 2024-05-22T10:16:27.3866667+00:00
ADF_Coder 0 Reputation points
commented 2024-05-28T16:49:14.18+00:00
ADF_Coder 0 Reputation points
4 answers

how to resolve this error while getting data from databricks from power bi.Not able to load table in power bi

DataSource.Error: ODBC: ERROR [HY000] [Microsoft][DSI] (20039) Cannot store ""."".""."REMARKS" value in temporary table without truncation. (Column metadata implied a maximum of 512 bytes, while provided value is…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2023-10-23T18:45:35.15+00:00
Pratibha Khare 20 Reputation points
answered 2024-05-28T15:31:06.68+00:00
Mike 0 Reputation points
0 answers

Questions on Azure Databricks prepurchase plan

Hello, Our customer had purchased Azure Databricks prepurchase plans but their utilization is not even 50% over 3 months and their expiry date is 15 months from today. Could you please clarify on following questions? Why Azure Databricks prepurchase…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-28T09:35:55.2666667+00:00
Anil Kumar 200 Reputation points
edited the question 2024-05-28T14:17:05.2633333+00:00
SadiqhAhmed-MSFT 39,236 Reputation points Microsoft Employee
0 answers

Is there a way to restrict multiple instances of an ADF pipeline on Same path of event based trigger?

I'm running Spark notebooks in Synapse using an event-based ADF pipeline. If any notebook gets triggered twice, I encounter a conflict error. I understand that I can avoid concurrent execution of the pipeline by setting the concurrency to 1. However, I…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,484 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,776 questions
asked 2024-05-28T11:16:32.32+00:00
Divya Sharma 1 Reputation point
edited a comment 2024-05-28T12:42:32.97+00:00
Harishga 4,250 Reputation points Microsoft Vendor
1 answer

Are High Concurrency clusters deprecated or renamed in UC databricks worskpace

Hello Team, Is the High concurreny clusters deprecated. Even I don't see Custom Access mode in UC enabled databricks workspace UI. I went through this article https://learn.microsoft.com/en-us/azure/databricks/archive/compute/cluster-ui-preview but I am…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-28T10:02:40.75+00:00
Ashwini Gaikwad 65 Reputation points
answered 2024-05-28T11:07:14.4533333+00:00
PRADEEPCHEEKATLA-MSFT 80,096 Reputation points Microsoft Employee
2 answers

How to ignore the records in ADF Data Flows

Hi All I am building a data transamination using mapping data flows ,I have a time stamp field Like TimeStampUpdated in the target table. I want to lockup historical data with incremental data transamination and ignore the records coming in the…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,484 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,776 questions
asked 2024-05-23T06:58:12.53+00:00
venkat rao 45 Reputation points
commented 2024-05-28T08:18:49.66+00:00
ShaikMaheer-MSFT 38,206 Reputation points Microsoft Employee
2 answers

Firewall Configuration for Custom Model Serving in Azure Databricks

Hi, I am encountering an error when trying to serve my custom LLM model endpoint. The error message reads: "Container image creation failed, see Build Logs for details. If there are no build logs, the failure may be due to storage firewall…

Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,768 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-23T10:43:43.7833333+00:00
Pejman Memar 0 Reputation points
answered 2024-05-28T06:33:26.23+00:00
Nehruji R 3,041 Reputation points Microsoft Vendor
2 answers

Azure to AWS

Hello We need to transfer files from ADLS to AWS (S3 bucket) for a SAS application hosted in third party in batches. We need to ensure data security and best practices. My understanding, we can use ADF to create a linked service for AWS S3 but IT DOES…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,374 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,776 questions
asked 2024-05-20T08:39:35.8366667+00:00
Sourav 80 Reputation points
edited a comment 2024-05-28T05:16:05.3666667+00:00
Sumarigo-MSFT 44,096 Reputation points Microsoft Employee
2 answers

How to setup modern Arcitechure for Small/Medium Business?

Currently we're using the following setup which is slow to process the data and is slow on the power bi side: Azure VM for third parties to upload via sftp C# script to ETL data to azure sql server and move files to ADLS Gen2 Power BI report pulling…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,374 questions
Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,290 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-23T20:55:59.0633333+00:00
Jordan 5 Reputation points
answered 2024-05-28T04:54:01.1433333+00:00
Sumarigo-MSFT 44,096 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How do I figure out what public IP ranges my Databricks workspace clusters are coming from?

Edit: I am rewriting this to clarify the ask. Relatively new to Databricks. I am trying to understand how outbound traffic from clusters is determined. It seems to differ if SCC is enabled vs when it's not. With no SCC: VMs start up with a…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-08T22:13:53.72+00:00
McDonald, Matthew 101 Reputation points
commented 2024-05-28T02:06:14.0233333+00:00
PRADEEPCHEEKATLA-MSFT 80,096 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Invalid records failed in DQ checks

We are capturing the records that failed in DQ checks by using Databricks in the Blob storage for business owners to resolve inconsistencies, we have added an extra column as DQ checks failed reason. I have the following: What if the particular record…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-24T10:55:51.94+00:00
Anshal 2,006 Reputation points
accepted 2024-05-27T09:32:56.4466667+00:00
Anshal 2,006 Reputation points
1 answer One of the answers was accepted by the question author.

Databricks SQL endpoint

Hi friends, where does the databricks SQL endpoint stand with comparison to other data warehousing technologies such as Synapse, snowflake, and google cloud? please provide metrics related comparison in terms of costs,scalability and performance. Which…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-25T14:17:43.1666667+00:00
Anshal 2,006 Reputation points
accepted 2024-05-27T09:10:36.28+00:00
Anshal 2,006 Reputation points
4 answers One of the answers was accepted by the question author.

What is the difference between Databrick prepay and Databrick reservation in Azure ?

Hello, We are just considering ways to reduce Databrick cost in Azure other than buying RI for VMs behind Databrick clusters. What is the difference between Databrick prepay and Databrick reservation in Azure It seems Databrick reservation is named as…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-23T09:26:44.82+00:00
Anil Kumar 200 Reputation points
accepted 2024-05-27T05:32:31.28+00:00
Anil Kumar 200 Reputation points
2 answers

Why isn't code working as an expression for parameters in SQL Server Reporting with connection to Databricks SQL Warehouse using Simba Spark

Error: For more information about this error navigate to the report server on the local server machine, or enable remote errors ---------------------------- Query execution failed for dataset 'DataSet1'. (rsErrorExecutingCommand)…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
SQL Server Reporting Services
SQL Server Reporting Services
A SQL Server technology that supports the creation, management, and delivery of both traditional, paper-oriented reports and interactive, web-based reports.
2,827 questions
asked 2024-05-24T20:44:13.19+00:00
Maxwell, Niki 0 Reputation points
answered 2024-05-27T05:21:01.1466667+00:00
ZoeHui-MSFT 33,701 Reputation points
1 answer

Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)

Hello good people, I am getting this error "Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)" Please help. Thank You so much.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-24T00:53:43.1466667+00:00
Asma Khalid 0 Reputation points
commented 2024-05-27T04:08:21.1966667+00:00
PRADEEPCHEEKATLA-MSFT 80,096 Reputation points Microsoft Employee
1 answer

I don't see the Data tab in my 14-day trial for Azure databricks.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-03-13T01:54:21.8533333+00:00
Venkata Subba Reddy Bovilla 5 Reputation points
edited a comment 2024-05-26T22:14:16.96+00:00
Kulkarni, Gargi Renukadas 0 Reputation points
1 answer

How to use a different version of a Spark Java library dependency (antlr4) in a Databricks notebook?

Hello. I need to use in a Databricks notebook a custom made Java library which depends on Drools v8.40.1.Final which depends on ANTLR4 v4.10.1. When I try to invoke a method in my Java library I get the following error: "ANTLR Tool version 4.10.1…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-01-22T22:32:40.6433333+00:00
Martin Medina 5 Reputation points
commented 2024-05-24T16:55:41.88+00:00
Carlos Irazabal 0 Reputation points
2 answers

How to reduce unnecessary high memory usage in a Databricks cluster?

We are having unnecessary high memory usage even when nothing is running on the cluster. When the cluster first starts, it's fine, but when I run a script and it finishes executing, nothing gets back to the idle (initial) state (even hours after nothing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-08T08:58:46.4433333+00:00
Senad Hadzikic 20 Reputation points
answered 2024-05-24T12:27:24.49+00:00
Ben Gislason 0 Reputation points
0 answers

How to parse nested json array of document in ADF data flow

Hi all I am trying to fitch the values from a nested josh array of document , I have used aggregate to convert into objects but not able to fitch the values of all child nodes like as below itOffer.item itOffer.item.SplOfr itOffer.item.buy …

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,484 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,776 questions
asked 2024-05-02T15:44:57.8633333+00:00
venkat rao 45 Reputation points
commented 2024-05-24T09:38:47.0233333+00:00
phemanth 6,810 Reputation points Microsoft Vendor
1 answer

Azure Databricks workflow job failure

We have a stream workflow job that run 24*7 and loads the data in delta table for say: raw.deltaTableA Now, the problem is in case we are trying to optimize this delta (optimize raw.deltaTableA) table while the table is getting loaded we get frequent…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,975 questions
asked 2024-05-02T08:23:36.4266667+00:00
NIKHIL KUMAR 101 Reputation points
commented 2024-05-24T04:40:08.6533333+00:00
PRADEEPCHEEKATLA-MSFT 80,096 Reputation points Microsoft Employee